A General Method for Multi Agent Reinforcement Learning in Unrestricted Environments

نویسندگان

  • J urgen Schmidhuber
  • IDSIA Corso
چکیده

Previous approaches to multi agent reinforcement learning are either very limited or heuristic by na ture The main reason is each agent s environment continually changes because the other agents keep changing Traditional reinforcement learning algo rithms cannot properly deal with this This paper however introduces a novel general sound method for multiple reinforcement learning agents living a single life with limited computational resources in an unrestricted environment The method properly takes into account that whatever some agent learns at some point may a ect learning conditions for other agents or for itself at any later point It is based on an e cient stack based backtracking procedure called environment independent reinforcement accel eration EIRA which is guaranteed to make each agents learning history a history of performance im provements long term reinforcement accelerations The principles have been implemented in an illustra tive multi agent system where each agent is in fact just a connection in a fully recurrent reinforcement learning neural net

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Voltage Coordination of FACTS Devices in Power Systems Using RL-Based Multi-Agent Systems

This paper describes how multi-agent system technology can be used as the underpinning platform for voltage control in power systems. In this study, some FACTS (flexible AC transmission systems) devices are properly designed to coordinate their decisions and actions in order to provide a coordinated secondary voltage control mechanism based on multi-agent theory. Each device here is modeled as ...

متن کامل

An Unsupervised Learning Method for an Attacker Agent in Robot Soccer Competitions Based on the Kohonen Neural Network

RoboCup competition as a great test-bed, has turned to a worldwide popular domains in recent years. The main object of such competitions is to deal with complex behavior of systems whichconsist of multiple autonomous agents. The rich experience of human soccer player can be used as a valuable reference for a robot soccer player. However, because of the differences between real and simulated soc...

متن کامل

Incremental Self-improvement for Life-time Multi-agent Reinforcement Learning

Previous approaches to multi-agent reinforcement learning are either very limited or heuris-tic by nature. The main reason is: each agent's or \animat's" environment continually changes because the other learning animats keep changing. Traditional reinforcement learning algorithms cannot properly deal with this. Their convergence theorems require repeatable trials and strong (typically Markovia...

متن کامل

Improving Agent Performance for Multi-Resource Negotiation Using Learning Automata and Case-Based Reasoning

In electronic commerce markets, agents often should acquire multiple resources to fulfil a high-level task. In order to attain such resources they need to compete with each other. In multi-agent environments, in which competition is involved, negotiation would be an interaction between agents in order to reach an agreement on resource allocation and to be coordinated with each other. In recent ...

متن کامل

Utilizing Generalized Learning Automata for Finding Optimal Policies in MMDPs

Multi agent Markov decision processes (MMDPs), as the generalization of Markov decision processes to the multi agent case, have long been used for modeling multi agent system and are used as a suitable framework for Multi agent Reinforcement Learning. In this paper, a generalized learning automata based algorithm for finding optimal policies in MMDP is proposed. In the proposed algorithm, MMDP ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002